Are priors responsible for cosmology favoring additional neutrino species? 



O . 
(N ■ 

^ ■ 

^ ■ 
(N ; 

u: 
^' : 

Oh- 
I < 

o ■ 



> 

in 
o 

o 



Alma X. Gonzalez-Morales^, Robert Poltis^, Blake D. Sherwin'^, Licia Verde^ 
^ Instituto de Ciencias Nucleares, Universidad Nacional Autonoma de Mexico, Apdo. 70-543, 
CU, 04510 Mexico D.F. Part of the Collaboration Instituto Avanzado de Cosmologi'a. 
' HEPCOS, Department of Physics, SUNY at Buffalo, Buffalo, NY 14260-1500, USA. 
^ Department of Physics, Jadwin Hall, Princeton University, Princeton, NJ 08544, USA. 
^ ICREA & ICC, University of Barcelona (lEEC-UBj, Marti i Franques 1, Barcelona 08028, Spain. 

(Dated: June 28, 2011) 

It has been suggested that both recent cosmological data and the results of flavor oscillation experiments 
(MiniBooNE and LSND) lend support to the existence of low-mass sterile neutrinos. The cosmological data 
appear to weakly favor additional forms of radiation in the Universe, beyond photons and three standard 
neutrino families. We reconsider the cosmological evidence by making the resulting confidence intervals on the 
additional effective neutrino species as prior-independent as possible. We find that, once the prior-dependence 
is removed, the latest cosmological data show no evidence for deviations from the standard number of neutrino 
species. 



I. INTRODUCTION 

The standard model for particle physics has three massless 
neutrinos. Beyond the standard model physics is needed to 
give neutrinos a non-zero mass and hence explain measure- 
ments of neutrino oscillations. It is therefore reasonable, when 
considering extensions of the standard model, to explore the 
possibility of more than three neutrinos species. A deviation 
from the standard number of neutrino species N^^ ~ 3 
affects the expansion history of the early universe, which 
in turn modifies nucleosythesis constraints. Observations of 
the Cosmic Microwave Background (CMB) and of large- 
scale structure also probe the radiation density at matter- 
radiation equality and at recombination; this physical energy 
density in relativistic particles can be expressed in terms of 
the energy density in photons (which is highly constrained by 
the measurement of the CMB temperature) and the effective 
number of neutrino species A^cff which, even in the standard 
scenario ll28ll . differs from the number of neutrino species 
Ni, to account for QED effects and for neutrinos being not 
completely decoupled during electron-positron annihilation 
(see iQll for a review and refs. therein). Any light particle 
that does not couple to electrons, ions and photons, such 
as a sterile neutrino, will act as an additional relativistic 
species. Deviations from the standard A^cff value can also 
arise from decay of dark matter particles, quintessence, light 
axions or other exotic models Nucleosynthesis and 

CMB constraints on Ni, rely on completely different physics 
and coiTespond to very different epochs in the Universe's 
evolution, providing an overall consistency check. 

Recently, there has been renewed interest in the deviations 
for the standard number of neutrino species, see e.g., 
Extra neutrinos could explain for example the LSND results 
and some of the latest MiniBoone results The anoma- 

lous X-ray narrow emission feature from the dwarf spheroidal 
galaxy Willman 1 has also been explained in terms of sterile 
neutrinos 101 ■ 

The current situation is summarized in Ref. jsll. The 
authors consider the latest cosmological constraints on the 
number of neutrino species and their possible connection 
with particle physics experiments. They find that recent 



cosmological data, alone or in combination with big-bang 
nucleosynthesis constraints, weakly favor extra radiation in 
the Universe beyond photons and ordinary neutrinos, lending 
support to the existence of low-mass sterile neutrinos. They 
find that the amount of additional radiation corresponds to one 
extra N^s- An extra species of low mass sterile neutrinos 
would fit both cosmological data and flavor oscillation results. 

Note that in some analyses of cosmological data the stan- 
dard iVoff value is ruled out at better than 95% confidence, e.g. 
1^, while in other analyses it is not, e.g. |flot]. More recently, 
the latest CMB temperature data, including high-resolution 
measurements of the damping tail, were combined in Ref. ifTHl 
to show that this preference for extra neutrino species in 
the cosmological data comes from their contribution to the 
expansion rate prior to recombination. This analysis also rules 
out the standard neutrino number, A^cff , at the 95% and 98.4% 
confidence level with data sets consisting of CMB data and of 
a combination of CMB, Large-scale structure and the Hubble 
constant respectively. The physical interpretation offered is 
that a larger value of A'eff produces an increased SiUc damping 
at high £ and this seems to improve the fit to the data. 

Given the importance of the consequences of a deviation 
from the standard effective number of neutrino species and the 
reported hints for extra radiation from precision cosmological 
data, we reconsider the cosmological constraints on A^oft- 



n. METHOD 

Early work (ll9l [Tol[T3l-fl6ll and references therein) as well 
as more recent papers llll [l2ll have reported constraints 
derived in the Bayesian framework. In this framework con- 
straints are obtained from the posterior distribution, which is 
marginalized over the uninteresting parameters to derive the 
desired confidence intervals. While this approach is extremely 
powerful and enables one to do statistical inference, the 
posterior distribution depends on the priors chosen for each 
of the parameters, and this choice is often arbitrary. Further- 
more, when deriving confidence intervals, the marginalization 
procedure weights parameters by the posterior volume, rather 
than just by how well the parameters fit the data. When 
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considering a high-dimensional parameter space, especially 
if the data-sets are not highly constraining, the reported 
confidence regions on a parameter can depend strongly on the 
prior choice and on the resulting prior volume effects. 

Moreover the widely used software "Get Dist" - part of the 
CosmoMC package 111711 - provides by default the Bayesian 
central credible interval while the minimal credible interval 
may be more suitable for statistical inference. As noted in 
lUSll these two confidence intervals differ significantly if the 
posterior is skewed and the effect is large especially for A'off 
if the data-sets chosen are not very constraining. As more data 
sets are considered, degeneracies are reduced, the posterior 
become nearly Gaussian and the two confidence intervals 
become closer. 

If a result of cosmological analysis, such as any evidence 
for iVoff > 3.04, turns out to be driven by any of the effects 
above, it should be interpreted with extra care. 

The dependence on the (arbitrary) choice for the prior, on 
the posterior volume effect in marginalization and the ambi- 
guity of central vs minimal credible interval can be removed 
by making the analysis as prior-independent as possible. 

In the frequentist approach the generalized likelihood ratio 
is widely used to report confidence intervals. The likelihood 
ratio is a well established technique to compute frequentist 
confidence interval when only one parameter is considered, 
but it lacks of a feature equivalent to Bayesian marginaliza- 
tion when dealing with high-dimensional (multi-parameters) 
problems. The generalized (or profile) likelihood ratio on the 
other hand includes such a feature. 

Fortunately, it is possible to extract profile likelihood ratio- 
based confidence intervals from the standard output of cos- 
mological analyses implemented with Markov Chain Monte 
Cai-lo methods liToll . 

To perform our Markov Chain Monte Carlo analysis, we 
use the publicly available CosmoMC package [17]. The 
output from a CosmoMC analysis for a given data set contains 
the likelihood (L) value for each accepted set of parameters. 
Since we are intersted only in constraints of iVcff , we find the 
maximum likelihood (£jv„ii ) value as a function of N^s over 
the parameter space sampled in the chain, without regard to 
the values that the rest of the parameters have. To perform 
this maximization in practice we bin in iVoff with a bin width 
of 0.5. We verified that the results do not depend strongly on 
the choice of binning (as long as bins are not so small that they 
give a very noisy estimate and not so large that they erase any 
signal). We obtain the profile likelihood ratio by considering 
ln(LAr^jj / imax) as a function of A'off; where imax is the 
maximum likelihood in the entire chain. 

Then, always following ifioll . we use the pseudo- 
chisquare defined as In (Lat^jj / Lmax) = 1/2%^' so that 
A In {Ln,!! I imax) = 0.5 and A In (Ljv,j.f / £max) = 2 cor- 
respond to the 68.3% and 95.4% confidence regions respec- 
tively. Finally we report our results in terms of the probability 
for AA'eff = A"cff — 3.04 for each data set considered. The 
correspondence between intervals and probability is done - 
as it is customary in this context- assuming Gaussian statistics; 
the confidence intervals we report therefore could be slightly 
underestimated because of our Gaussian assumption. 



We consider the following data-set combinations: Wilkin- 
son Microwave Anisotropy Probe 7-year data lfl9l I20I1 
(WMAP) in combination with a Hubble constant prior of 
II21I1 . WMAP in combination with the baryon acoustic os- 
cillation constraints from the Sloan Digital Sky Survey data 
release 7 (SDSS DR7) luminous red galaxies (LRG) IH 
(WMAPh-BAO), WMAP in combination with the SDSS DR7 
LRG halo power spectrum of Ref.iH (WMAPh-LRG), and 
the luminosity distance measurements of type lA supernovae 
of Ref. ill] (SN). We also consider high resolution CMB 
experiments for which data, angular power spectra in the 
multipole range of interest and likelihood functions were 
released publicly at the time of writing: ACBAR llzsIl and 
ACT id. In our analysis we keep the Helium mass fraction 
(Yp) fixed to the concordance value. Some analyses llTll[l2ll 
allow Yp to vary. Adding yet another free parameter in the 
analysis opens up extra degeneracies (in particular N^s and 
Yp are anti-correlated), amplifying and possibly exacerbating 
the prior effects that, we suspect, may "bias" the marginalized 
posteriors. 

In our chains we use a flat prior on iJp rather than a flat 
prior on the angular diameter distance to the last scattering 
surface as in the standard CosmoMC implementation. In fact 
the fitting formula used by the code to compute the angular 
diameter distance is accurate for standard number of neutrino 
species but it is not otherwise. In practice, using a flat prior 
on this quantity is equivalent to using a non-flat and somewhat 
complex prior on the true angular diameter distance for non- 
standard number of neutrinos. This effect goes away when 
using a flat prior on iJp and bypassing this analytic fit to 
the angular diameter distance (see e.g., II14I1 for discussion on 
this). For our specific application, using the profile likelihood 
the choice of prior does not matter and indeed we have verified 
that by running chains with both a flat prior on and on 
the angular diameter distance for some selected cases: the 
profile likelihood ratio results are indistinguishable, while the 
marginalized posteriors are not. 



III. RESULTS AND CONCLUSIONS 

We begin by considering WMAP7 CMB data, both alone 
and in combination with large-scale structure and supernova 
datasets. A compilation of the constraints from the different 
datasets we considered is shown in figure [T] Note that for 
all datasets, although the maximum likelihood value is always 
at AiVoff > 0, ATVoff = is always well within the 95% 
error bars. All these combinations have the WMAP7 data 
in common are are hence not at all statistically independent, 
so it is not surprising that if in the WMAP data alone the 
maximum likelihood value is AA^cff > 0, this persists in 
all combinations. As an illustrative example on how these 
error-bars were derived from the profile likelihood ratio, in 
Fig. |2] we show the probability curve obtained from the 
profile likelihood ratio for the combination WMAP7H-LRG. 
This can be compared directly with the results of ifioll . 
We then consider the constraints from data-sets combination 
involving WMAP7 and higher-resolution CMB experiments. 
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FIG. 1: One and two a constraints on the number of additional 
neutrino species from several datasets including combinations of 
WMAP7, LRG, BAO and SN. Note that the value corresponding to 
no extra species, AA^^fr — 0, is always well within the 2-a interval 
and always very close to the l-cr interval. 
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FIG. 2: As an illustrative example we show the probability curve 
obtained from the profile likelihood ratio for the data combination 
WMAP+LRG. The curve is jagged because of the binning procedure 
used to compute the profile likelihood. We have checked that 
changing the bin size (within reason) only slightly changes the shape 
of the curve and does not change the inferred confidence regions. 



WMAP+ACT 
WMAP+BAO+HO+ACT 
WMAP+ACBAR 
WMAP+ACBAR+ACT 
WMAP+ACBAR+BAO 
WMAP+ACBAR+BAO+ACT 
WMAP+ACBAR + BAO+HO 
WMAP+ACBAR+BAO+HO+ACT 




A N,„ 



FIG. 3: Overview of one and two a constraints on extra number 
of neutrino species from several dataset combinations involving 
WMAP7 in combination with higher resolution CMB experiments. 
Note that the point corresponding to no additional relativistic species, 
AA'eff = 0, is always well within the 2-a interval and in few cases 
within the l-cr interval, the fact that all constraints seems to be 
systematically shifted (although by less than l-cr) towards AA^cft > 
is due to the fact that these are not independent constraints: the 
WMAP data, which has a lot of statistical power, is common to all 
combinations of datasets. 




FIG. 4: As an illustrative example we show the probability curve 
obtained from the profile likelihood ratio for one of the data combi- 
nations. 



The constraints from the different dataset combinations are 
summarized in Fig. [3] and an illustrative example of probabil- 
ity curve for one combination is shown in Fig. |4] Again, the 
standard value AA^off = is always well within the 95.4% 
confidence region and in many cases within the l-cr region. 

If we had looked at the marginalized posterior rather than 
the profile likelihood, the following data-set combination 
would have given an almost 2-cr indication of AA'off > 
0: WMAP7, WMAP7+BAO+H0, WMAP7+LRG+H0 (see 
lfl6ll ). Recall also that looking at the marginalized 
posterior, for the combinations CMB+HO+SN+BAO and 
CMB+HO+SN+LSS, H find a 2-cr signal for AA^off > 
and that IH find a 2-cr signal for AA^eff > for WMAP7+ 



ACT+ACBAR+BAO+HO, although using different priors and 
slightly different parameter sets (i.e. the treatment of Yp). 

We can also try to compare with i8|0 and a more recent 
work ll26ll . In these references the authors also allow the 
neutrino masses to vary. Current cosmological data do not 
show any evidence for a non-zero neutrino mass and do not 
have enough statistical power to detect a sum of neutrino 
masses (S) below 0.3 eV (e.g., ii^ IH). Including both 
A^cff and the sum of neutrino masses as free parameters would 
add new degeneracies and exacerbate further the prior volume 
effect in the marginalized posterior distributions, especially 
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since Ncs and E are positively correlated. To compare with 
our analysis we need to consider their constraints for E = 0. 
They find still almost l-a indication of ATVoff > ll29ll . Our 
analysis seems to indicate that this evidence may be driven by 
prior volume effects. 

To summarize, the effective number of neutrino species pa- 
rameterizes any non-standard early-Universe expansion rate. 
Evidence for AA^cff 7^ could be interpreted as extra (sterile) 
neutrino species but also as any light particle that does not 
couple to electrons, the decay of dark matter particles, early 
quintessence and other phenomena. Recent cosmological 
analyses carried out in the Bayesian framework have reported 
hints of AA^cff > 0; extra neutrino species could explain 
some recent claims by particle physics experiments (LSND, 
MiniBoone). In the Bayesian framework, when considering a 
high-dimensional parameter space as in this case, especially 
if the data-sets are not highly constraining and cosmological 
degeneracies are present, the reported marginalized confi- 
dence regions on a parameter can depend strongly on the prior 
choice and on prior volume effects on other parameters. If a 
result of cosmological analysis, such as evidence for AA^cff > 
0, turns out to be driven by any of these effects, it should be 
interpreted with extra care. We have presented a way to make 
the cosmological analysis as prior-independent as possible; 
we borrowed from the frequentist approach the so-called 
generalized likelihood ratio to report confidence intervals. We 
have considered a suite of cosmological data sets and data sets 
combinations and found that prior-independent confidence 



intervals for AA^cff do not show any evidence of additional 
effective neutrino species. Our findings seems to indicate 
that any evidence for AA^off > may be driven by prior 
effects. As better data become available, the likelihood should 
overcome the prior and the posterior should become nearly 
Gaussian, so that this effect should gradually disappear In the 
meantime we advocate the use of the generalized likelihood 
ratio as a useful check of how dependent cosmological results 
are on the choice of priors. 
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